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Abstract 

A recent papc^r [1] proposed a provably optimal, polyno- 
mial time method for performing near-isometric point pat- 
tern matching by means of exact probabilistic inference 
in a chordal graphic;al model. Their fundamental result 
is that the chordal graph in question is shown to be glob- 
ally rigid, implying that exact inference provides the same 
matching solution as exact inference in a complete graph- 
ical model. This implies that the algorithm is optimal 
when there is no noise in the point patterns. In this pa- 
per, we present a new graph which is also globally rigid 
but has an advantage over the graph proposed in [1]: its 
maximal clique size is smaller, rendering inference signif- 
icantly more efficient. However, our graph is not chordal 
and thus standard Junction Tree algorithms cannot be di- 
rectly applied. Nevertheless, we show that loopy belief 
propagation in such a graph converges to the optimal so- 
lution. This allows us to retain the optimality guarantee in 
the noiseless case, while substantially reducing both mem- 
ory requirements and processing time. Our experimental 
results show that the accuracy of the proposed solution is 
indistinguishable from that of [1] when there is noise in 
the point patterns. 



1 Introduction 

Point pattern matching is a fundamental problem in pat- 
tern recognition, and has been modeled in several different 
forms, depending on the demands of the application do- 
main in which it is required [2,3]. A classic formulation 
which is realistic in many practical scenarios is that of 
near-isometric point pattern matching, in which we are 
given both a "template" (T) and a "scene" (<S) point pat- 
terns, and it is assumed that S contains an instance of 
T (say T'), apaxt from an isometric transformation and 
possibly some small jitter in the point coordinates. The 
goal is to identify T' in S and find which points in T 
correspond to which points in T'. 

Recently, a method was introduced which solves this 
problem efficiently by means of exact belief propagation in 
a certain graphical model [1]. The approach is appealing 
because it is optimal not only in that it consists of exact 
inference in a graph with small maximal clique size (= 4 
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for matching in R^), but that the graph itself is optimal. 
There it is shown that the maximum a posteriori (MAP) 
solution in the sparse and tractable graphical model where 
inference is performed is actually the same MAP solution 
that would be obtained if a fully connected model (which 
is intractable) could be used. This is due to the so-called 
global rigidity of the chordal graph in question: when the 
graph is embedded in the plane, the lengths of its edges 
uniquely determine the lengths of the absent edges (i.e. the 
edges of the graph complement) [4]. The computational 
complexity of the optimal point pattern matching algo- 
rithm is then shown to be O(nm^) (both in terms of pro- 
cessing time and memory requirements), where n is the 
number of points in the template point pattern and m is 
the number of points in the scene point pattern (usually 
with m > n in applications). This reflects precisely the 
computational complexity of the Junction Tree algorithm 
in a chordal graph with 0{n) nodes, 0(m) states per node 
and maximal cliques of size 4. The authors present exper- 
iments which give evidence that the method substantially 
improves on well-known matching techniques, including 
Graduated Assignment [5]. 

In this paper, we show how the same optimality proof 
can be obtained with an algorithm that runs in O(nm^) 
time per iteration. In addition, memory requirements are 
precisely decreased by a factor of m. We are able to 
achieve this by identifying a new graph which is globally 
rigid but has a smaller maximal clique size: 3. The main 
problem we face is that our graph is not chordal, so in 
order to enforce the running intersection property for ap- 
plying the Junction Tree algorithm the graph should first 
be triangulated; this would not be interesting in our case, 
since the resulting triangulated graph would have larger 
maximal clique size. Instead, we show that belief prop- 
agation in this graph converges to the optimal solution, 
although not necessarily in a single iteration. In practice, 
we find that convergence occurs after a small number of 
iterations, thus improving the running-time by an order 
of magnitude. We compare the performance of our model 
to that of [1] with synthetic and real point sets derived 
from images, and show that in fact comparable accuracy 
is obtained while substantial speed-ups are observed. 



2 Background 

We consider point matching problems in M^. The problem 
we study is that of near-isometric point pattern matching 
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(as defined above), i.e. one assumes that a near-isometric 
instance (T') of tiie template (T) is somewhen "hidden" 
in the scene (5). By "near-isometric" it is meant that 
the relative distances of points in T are approximately 
preserved in T'. For simplicity of exposition we assume 
that T, T', and S are ordered sets (their elements are 
indexed). Our aim is to find a map x : T S with image 
T' that best preserves the relative distances of the points 
in T and T', i.e. 

X* ^i,rgmm\\D{T)-D{x{T))\\l, (1) 

X 

where D{T) is the matrix whose {i^iY^ entry is the Eu- 
clidean distance between points indexed by i and j in T . 
Note that finding x* is inherently a combinatorial opti- 
mization problem, since T' is itself a subset of S, the 
scene point pattern. In [1], a generic point in T is mod- 
eled as a random variable {Xi), and a generic point in S is 
modeled as a possible realization of the random variable 
{xi). As a result, a joint realization of all the random vari- 
ables corresponds to a match between the template and 
the scene point patterns. A graphical model (see [6,7]) is 
then defined on this set of random variables, whose edges 
are set according to the topology of a so-called 3-tree graph 
(any 3-tree that spans T). A 3-tree is a graph obtained 
by starting with the complete graph on 3 vertices, K^, 
and then adding new vertices which are connected only to 
those same 3 vertices.^ Figure 1 shows an example of a 3- 
tree. The reasons claimed in [1] for introducing 3-trees as 
a graph topology for the probabilistic graphical model are 
that (i) 3-trees are globally rigid in the plane and (ii) 3- 
trees are chordaP graphs. This implies (i) that the 3-tree 
model is a type of graph which is in some sense "optimal" 
(in a way that will be made clear in the next section in 
the context of the new graph we propose) and (ii) that 3- 
trees have a Junction Tree with fixed maximal clique size 
(= 4); as a result it is possible to perform exact inference 
in polynomial time [1]. 

Potential functions are defined on pairs of neighboring 
nodes and are large if the difference between the distance 
of neighboring nodes in the template and the distance be- 
tween the nodes they map to in the scene is small (and 
small if this difference is large). This favors isometric 
matchings. More precisely, 

ipijiXi = Xi,Xj = Xj) = f{d{Xi,Xj) - d{xi,Xj)) (2) 

where /(•) is typically some unimodal function peaked at 
zero (e.g. a zero-mean Gaussian function) and d{-, •) is the 
Euclidean distance between the corresponding points (for 
simplicity of notation we do not disambiguate between 
random variables and template points, or realizations and 

^Technically, connecting new vertices to the 3 nodes of the orig- 
inal K3 graph is not required: it suffices to connect new vertices to 
any existent 3-clique. 

A chordal graph is one in which every cycle of length greater than 
3 has a chord. A chord of a cycle is an edge not belonging to the 
cycle but which connects two nodes in the cycle (i.e. a "shortcut" 
in a cycle). 




Figure 1 : An example of a 3-tree 

scene points). For the case of exact matching, i.e. when 
there exists an x* such that the minimal value in (1) is 
zero, then /(•) = S{-) (where S{-) is just the indicator 
function l{o}('))- The potential function of a maximal 
clique (^) is then simply defined as the product of the 
potential functions over its 6 {— C|) edges (which will 
be maximal when every factor is maximal). It should be 
noted that the potential function of each edge is included 
in no more than one of the cliques containing that edge. 

For the case of exact matching (i.e. no jitter), it is shown 
in [1] that running the Junction Tree algorithm on the 3- 
tree graphical model with /(•) = 6{-) will actually find a 
MAP assignment which coincides with x* , i.e. such that 
\\D{T) - L>(a;*(r))||2 = 0. This is due to the "graph rigid- 
ity" result , which tells us that equality of the lengths of the 
edges in the 3-tree and the edges induced by the matching 
in T' is sufficient to ensure the equality of the lengths of all 
pairs of points in T and T'. This will be made technically 
precise in the next section, when we prove an analogous 
result for another graph. 

3 An Improved Graph 

Here we introduce another globally rigid graph which has 
the advantage of having a smaller maximal clique size. 
Although the graph is not chordal, we will show that exact 
inference is tractable and that we will indeed benefit from 
the decrease in the maximal clique size. As a result we will 
be able to obtain optimality guarantees like those from [1]. 
Our graph is constructed using Algorithm 1. 



Algorithm 1 Graph Generation for Q 

1 Create a cycle graph by traversing all the nodes in T 
(in any order) 

2 Connect all nodes whose distance in this cycle graph 
is two (i.e. connect each node to its neighbor's neighbor) 



2 




Figure 2: The general form of the graph we consider, with 
n nodes. 

This algorithm will produce a graph hke the one shown 
in Figure 2. We will denote by G the set of graphs that 
can be generated by Algorithm 1. G = {V,E) will denote 
a generic graph in Q. 

In order to present our results we need to start with the 
definition of a globally rigid graph: 

Definition 1. A planar graph embedding G is said to be 
globally rigid in if the lengths of the edges uniquely 
determine the lengths of the edges of the graph complement 
ofG. 

So our statements are really about graph embeddings in 
M.'^ , but for simplicity of presentation we will simply refer 
to these embeddings as "graphs" . 

This means that there are no degrees of freedom for the 
absent edges in the graph: they must all have specified 
and fixed lengths. To proceed we need a simple definition 
and some simple technical lemmas. 

Definition 2. A set of points is said to be in general 
position in if no 3 points lie in a straight line. 

Lemma 3. Given a set of points in general position in , 
if the distances from a point P to two other fixed points 
are determined then P can be in precisely two different 
positions. 

Proof. Consider two circles, each centered at one of the 
two reference points with radii equal to the given distances 
to point P. These circles intersect at precisely two points 
(since the 3 points axe not coUinear). This proves the 
statement. □ 

The following lemma follows directly from lemma 1 in 
[1], and is stated without proof. 

Lemma 4. Given a set of points in general position in 
, if the distances from a point P to three other fixed 
points are determined then the position of P is uniquely 
determined. 

We can now present a proposition. 



Proposition 5. Any graph G £ Q arising from Algorithm 
1 is globally rigid in the plane if the nodes are in general 
position in the plane. 

Proof. Define a reference frame S where points 1, 2 and 
n have specific coordinates (we say that the points are 
"determined" ) . We will show that all points then have de- 
termined positions in S and therefore have determined rel- 
ative distances, which by definition implies that the graph 
is globally rigid. 

We proceed by contradition: assume there exists at least 
one undetermined point in the graph. Then we must have 
an undetermined point i such that i — 1 and i ~ 2 are 
determined (since points 1 and 2 are determined). By 
virtue of lemma 4, points i -I- 1 and i + 2 must then be also 
undetermined (otherwise point i would have determined 
distances from 3 determined points and as a result would 
be determined). 

Let us now assume that only points i^i + l,i + 2 are un- 
determined. Then the only possible realizations for points 
i, i + 1 and i + 2 are their reflections with respect to the 
straight line which passes through points i — 1 and i -I- 3, 
since these are the only possible realizations that maintain 
the rigidity of the triangles (« — 1, i, i-|-l), (i, i-t-1, i-t-2), (z-|- 
1 , i + 2, i H- 3) , since i — 1 and i + 2> are assumed fixed. How- 
ever, since i-\-A and i — 2 are also fixed by assumption, this 
would break the rigidity of triangles {i + 2,i + 'i,i + 4) and 
(i,i — — 2). Therefore z -I- 3 cannot be determined. This 
can then be considered as the base case in an induction 
argument which goes as follows. Assume only i, . . . ,i + p 
are undetermined. Then, by reflecting these points over 
the line that joins i — 1 and i + p + I (which are fixed 
by assumption), we obtain the only other possible real- 
ization consistent with the rigidity of the triangles who 
have all their vertices mi— l,...,i + p+ l. However, 
this realization is inconsistent with the rigidity of trian- 
gles {i + p,i + p+ l,i + p + 2) and (i, i — 1, i — 2), therefore 
i + p + I must not be determined and by induction any 
point j such that j > i + 2 must not be determined, which 
contradicts the assumption that n is determined. As a 
result, the assumption that there is at least one undeter- 
mined point in the graph is false. This implies that the 
graph has all points determined in S, and therefore all rel- 
ative distances are determined and by definition the graph 
is globally rigid. This proves the statement. □ 

Although we have shown that graphs G £ Q are globally 
rigid, notice that they are not chordal. For the graph in 
Figure 2, the cycles (1, 3, 5 ... , n—1, 1) and (2,4, 6 . . . , n, 2) 
have no chord. Moreover, triangulating this graph in order 
to make it chordal will necessarily increase (to at least 4) 
the maximal clique size (which is not sufficient for our 
purposes since we arrive at the case of [1]). 

Instead, consider the clique graph formed hy G € Q. 
If there are n nodes, the clique graph will have cliques 
(l,2,3),(2,3,4),...,(n-2,n-l,n),(n-l,n,l),(n,l,2). 
This clique graph forms a cycle, which is depicted in Figure 
^1 

•^Note that if we connected every clique whose nodes intersected, 
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Figure 3: The chque graph obtained from the graph in 
Figure 2. 

We now draw on results first obtained by Weiss [8] , and 
confirmed elsewhere [9]. There it is shown that, for graphi- 
cal models with a single cycle, belief propagation converges 
to the optimal MAP assignment, although the computed 
marginals may be incorrect. Note that for our purposes, 
this is precisely what is needed: we are after the most likely 
joint realization of the set of random variables, which cor- 
responds to the best match between the template and the 
scene point patterns. Max-product belief propagation [10] 
in a cycle graph like the one shown in Figure 3 amounts 
to computing the following messages, iteratively: 

mi^i+i{Ui n U^+l) 

= max <i/{U^)m,^i^i{U^nU^-i), (3) 

Ui\Ui+i 

where Ui is the set of singleton variables in clique node i, 
"^(Ui) the potential function for clique node i and m^^i+i 
the message passed from clique node i to clique node i + 
1. Upon reaching the convergence monitoring threshold, 
the optimal assignment for singleton variable j in clique 
node i is then computed by argmax^.^^ {Ui)'mi^i^i{Uin 
Ui_i)mi+i^i{Ui n U,+i). 

Unfortunately, the above result is only shown in [8] when 
the graph itself forms a cycle, whereas we only have that 
the clique graph forms a cycle. However, it is possible to 
show that the result still holds in our case, by consider- 
ing a new graphical model in which the cliques themselves 
form the nodes, whose cliques are now just the edges in the 
clique graph. The result from [8] can now be used to prove 
that belief propagation in this graph converges to the op- 
timal MAP assignment, which (by appropriately choosing 
potential functions for the new graph), implies that belief 
propagation should converge to the optimal solution in the 
original graph also. 

the clique graph would no longer form a cycle; here we have only 
formed enough connections so that the intersection of any two cliques 
is shared by the cliques on at least one path between them (similar 
to the running intersection property for Junction Trees). 



To demonstrate this, we need not only show that belief 
propagation in the new model converges to the optimal 
assignment, but also that belief propagation in the new 
model is equivalent to belief propagation in the original 
model. 

Proposition 6. The original clique graph (Figure 3) can 
be transformed into a model containing only pairwise po- 
tentials, whose optimal MAP assignment is the same as 
the original model's. 

Proof. Consider a clique "node" Ci = {Xi, X2, X^) (in 
the original graph), whose neighbors share exactly two of 
its nodes (for instance C2 = {X2,X:},X4)). Where the 
domain for each node in the original graph was simply 
{1,2. . . \S\}, the domain for each "node" in our new graph 
simply becomes {1,2... \S\} . 

In this setting, it is no longer possible to ensure that 
the assignment chosen for each "node" is consistent with 
the assignment to its neighbor - that is, for an assignment 
{xi,X2,xs) to Ci, and (xj, 2:3, 0:4) to C2, we cannot guar- 
antee that X2 = X2, or — x';^. Instead, we will simply 
define the potential functions on this new graph in such a 
way that the optimal MAP assignment implicitly ensures 
this equality. Specifically, we shall define the potential 
functions as follows: for two cliques C/ = {Ii, 12,13) and 
Cj = (-^i, -^2, -/s) in the original graph (which share two 
nodes, say {12,13) and (Ji, J2)), define the pairwise po- 
tential for the clique {^'j j) in the new graph as follows: 

*/,j(*(123)>i(123)) = 

f ^'/(«1,«2,«3) if («2,«3) = (il, ^2), 

1^ p otherwise 

Where "ifj is simply the clique potential for the clique 
in the original graph; 1(123) G domain(/i) x domain(/2) x 
domain(/3) (sim. for j(i23))- That is, we are setting the 
pairwise potential to simply be the original potential of 
one of the cliques if the assignments are compatible, and p 
otherwise. If we were able to set p = 0, we would guarantee 
that the optimal MAP assignment was exactly the optimal 
MAP assignment in the original graph - however, this is 
not possible, since the result of [9] only holds when the 
potential functions have finite dynamic range. Hence we 
must simply choose p sufficiently small so that the optimal 
MAP assignment cannot possibly contain an incompatible 
match - it is clear that this is possible, for example p = 
(Hcmaxxc ^'c(xc))"^ will do. 

The result of [8] now implies that belief propagation in 
this graph will converge to the optimal MAP assignment, 
which we have shown is equal to the optimal MAP assign- 
ment in the original graph. □ 

Proposition 7. The messages passed in the new model 
are equivalent to the messages passed in the original model, 
except for repetition along one axis. 

Proof. We use induction on the number of iterations. 
First, we must show that the outgoing messages are the 
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same during the first iteration (during which the in- 
coming messages are not included). We will denote by 
'^iXuX2,X3)^iX2,X3,Xi) tlie message from {Xi,X2,X3) to 
(X2,X3, X4) during the z*'' iteration: 



'^lxi,X2,X3)^iX2,X3,X4)i^2,Xs) 

= niax*(Xi,x2,X3)(a;i,a;2,a;3), (5) 

^1 



"^(^(123) ,^(234) )^(^(234) ,^(346) ) (^123 , 2:234) 

= maxjf(^,3, «'^(^^3^,X(,34)(^(123)'a^(234)) 

= 1 X maxxi *(Xi,X2,X3)(a;i,a;2,a;3) (6) 
= ™(Xi,X2,X3)i-^(X2,X3,X4)(^2,a;3). 

This result only holds due to the fact that p will never 
be c;hosen when maximizing along any axis. We now have 
that the messages are equal during the first iteration (the 
only difference being that the message for the new model 
is repeated along one axis).^ Next, suppose during the 
(n — 1)"* iteration, the messages (for both models) are 
equal to k{xi,X2)- Then for the n*'* iteration we have: 



nTiXi,X2,X3)^{X2,X3,X4){X2,X3) 

= irmx{'^(^Xi,X2,X3){xi,X2,X3)K{xi,X2)} , (7) 

Xi 



m, 



Xl-I 231 :Xl' 



. (123), ^(234))'-(X(234).X(345)) (^123, 2:234) 

maxx(i,3) {*X(i23),X(234)(^(i23).a^(234))K(a:i,a;2)} 
1 X maxxi {^{Xi,X2,X3){xi,X2,X3)k{xi,X2)} 

'^{X^,X2,Xs)^{X2,X3,Xi)i^2,X3)■ 



(8) 



Hence the two message passing schemes are equivalent by 
induction. □ 

We can now state our main result: 

Theorem 8. Let G G G be a graph generated according 

to the procedure described in Algorithm 1. Assume that 
there is a perfect isometric instance ofT within the scene 
point pattern S. Then the MAP assignment x* obtained 
by running belief propagation over the clique graph derived 



from G IS such that \\D{T) - D{x*{T))\\l 



0. 



Proof. For the exact matching case, wc simply set /(•) = 
5{-) in (2). Now, for a graph G ^ Q given by Algorithm 1, 
the clique graph will be simply a cycle, as shown in Figure 
3, and following propositions 6 and 7 as well as the already 



To be completely precise, the message for the new model is 
actually a function of only a single variable - ^(234)- By "re- 
peated along one axis", we mean that for any given (a;2, 0:3, 354) G 
domain(X2) X domain(X3) X domain(Jf4), the message at this point 
is independent of X4, , which therefore has no effect when maximizing. 



mentioned result from [8] , belief propagation will find the 
correct MAP assignment a;*, i.e. 



X* = argmaxPG(-'^ = x) 

X 

= argmax JJ^ 6{d{Xi,Xj) — d{xi,Xj)), 



(9) 



where Pg is the probability distribution for the graphical 
model induced by the graph G. Now, we need to show that 
X* also maximizes the criterion which ensures isometry, 
i.e. we need to show that the above implies 



X* = argmax Pcompiete(-''^ = x) 

X 

= argmax jQ i5(d(Xj, Xj) — d{xi,Xj)), 



(10) 



where Pcompiete is the probability distribution of the 

graphical model induced by the complete graph. Note 
that X* must be such that the lengths of the edges in 
E are precisely equal to the lengths of the edges in Et' 
(i.e. the edges induced in S from E by the map X = x*). 
By the global rigidity of G, the lengths of E must then 
be also precisely equal to the lengths of Et' ■ This implies 
t'^^t Il^,r.i^,,)eESidi^i^XJ) - dix*,x*)) = 1. Since (10) 
can be expanded as 



argmax-^ YJ^ S{d{Xi,Xj) — d{xi,Xj)) 



H d{d{Xi,Xj)-d{xi,xj))], 



(11) 



hj-(hj)eE 



it becomes clear that x* will also maximize (10). This 
proves the statement. □ 

4 Experiments 

We have set up a series of experiments comparing the pro- 
posed model to that of [1] . Here we compare graphs of the 
type shown in Figure 2 to graphs of the type shown in Fig- 
ure 1. 

The parameters used in our experiments are as follows: 

e - this parameter controls the noise-level used in our 
model. Here we apply Gaussian noise to each of the points 
in T (with standard deviation e in each axis). We have 
run our experiments on a range of noise levels between 
and 4/256 (where the original points in T are chosen 
randomly between and 1) . Note that this is the same as 
the setting used in [1]. 

Potential functions ipij(Xi = Xi,Xj = Xj) = 
f{d{Xi,Xj) — d{xi,Xj)) - as in [1], we use a Gaussian 
function, i.e. exp ^ ^ -pj^g parameter a 

is fixed beforehand as u = 0.4 for the synthetic data, and 
cr = 150 for the real- world data (as is done in [1]). 

Dynamic range - as mentioned in section 2, the po- 
tential function ^'(x) is simply the product of tjjij{Xi = 
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Xi,Xj = Xj) for all edges [i,]) in x (here each max- 
imal cUque X contains 3 edges). The dynamic range 
of a function is simply defined as its maximum value 
divided by its minimum value (i.e. ™tn''^(x) )• 
der to prove convergence of our model, it is necessary 
that the dynamic range of our potential function is fi- 
nite [9]. Therefore, rather than using ^'(x) directly, we 
use *'(x) = (1/d) + (1 - l/c;)*(x). This ensures that the 
dynamic range of our model is no larger than d, and that 
— > ^' as d ^ 00. In practice, we found that varying this 
parameter did not have a significant effect on convergence 
time. Hence we simply fixed a large finite value {d = 1000) 
throughout. 

MSE-cutofF in order to determine the point at 
which belief propagation has converged, we compute the 
marginal distribution of every clique, and compare it to 
the marginal distribution after the previous iteration. Be- 
lief propagation is terminated when this mean-squared er- 
ror is less than a certain cutoff value for every clique in the 
graph. When choosing the mode of the marginal distribu- 
tions after convergence, if two values differ by less than 
the square-root of this cutoff, both of them are considered 
as possible MAP-estimates (although this was rarely an 
issue when the cutoff was sufficiently small). We found 
that as |iS| increased, the mean squared error between it- 
erations tended to be smaller, and therefore that smaller 
cutoff values should be used in these instances. Indeed, 
although the number of viable matches increases as |«S| 
increases, the distributions increase in sparsity at an even 
faster rate - hence the distributions tend to be less peaked 
on average, and changes are likely to have less effect on 
the mean squared error. Hence we decreased the cutoff 
values by a factor of 10 when |iS| > 30.^ 

The clique graph in which messages are passed by our 
belief propagation algorithms is exactly that shown in Fig- 
ure 3. It is worth noting, however, that we also tried 
running belief propagation using a clique graph in which 
messages were passed between all intersecting cliques; we 
found that this made no difference to the performance of 
the algorithm,^ and we have therefore restricted our ex- 
periments to the clique graph of Figure 3 in respect of its 
optimality guarantees. 

For the sake of running-time comparison, we imple- 
mented the proposed model, as well as that of [1] using 
the Elefant belief propagation libraries in Python.^ How- 
ever, to ensure that the results presented are consistent 
with those of [1], we simply used code that the authors 
provided when reporting the matching accuracy of their 
model. 

Figure 4 compares the matching accuracy of our model 
with that of [1] for |<S| = 10,20,30, and 40 (here we fix 



^Note that this is not a parameter in [1], in which only a single 
iteration is ever required. 

^Apart from one slight difference: including the additional edges 
appears to provide convergence in fewer iterations. However, since 
the number of messages being passed is doubled, the overall running- 
time for both clique graphs was ultimately similar. 

'^http : //elefant . developer . nicta . com . au/ 



[T[ = 10). The performanc;e of our algorithm is indistin- 
guishable from that of the Junction Tree algorithm. 

Figures 5 and 6 show the running-time and matching ac- 
curacy (respectively) of our model, as we vary the mean- 
squared error cutoff. Obviously, it is necessary to use a 
sufficiently low cutoff in order to ensure that our model 
has converged, but choosing too small a value may ad- 
versely effect its running-time. We found that the mean- 
squared error varied largely during the first few iterations, 
and we therefore enforced a minimum number of iterations 
(here we chose at least 5) in order to ensure that belief- 
propagation was not terminated prematurely. Figure 5 
reveals that the running-time is not significantly altered 
when increasing the MSE-cutoff - revealing that the model 
has almost always reached the lower cutoff value after 5 
iterations (in which case we should expect a speed-up of 
precisely |<S|/5). Furthermore, decreasing the MSE-cutoff 
does not significantly improve the matching accuracy for 
larger point sets (Figure 6), so choosing the lower cutoff 
does little harm if running-time is major a concern. Al- 
ternately, the Junction Tree model (which only requires a 
single iteration), took (for 5 = 10 to 40), 3, 44, 250, and 
1031 seconds respectively. These models differ only in the 
topology of the network (see section 3), and the size of 
the messages being passed; our method easily achieves an 
order of magnitude improvement for large networks.^ 

Finally, we present matching results using data from 
the CMU house sequence.*^ In this dataset, 30 points 
corresponding to certain features of the house are avail- 
able over 111 frames. Figure 7 shows the 71st and the 
last (111th) frames from this dataset. Overlayed on these 
images are the 30 significant points, together with the 
matches generated by the Junction Tree algorithm and 
our own (matching the first 20 points); in this instance, 
the Junction Tree algorithm correctly matched 16 points, 
and ours 17. Figure 8 shows how accurately points be- 
tween frames are matched as the baseline (separation be- 
tween frames) varies. We also vary the number of points 
in the template set (|T|) from 15 to 30. Our model seems 
to outperform the Junction Tree model for small base- 
lines, whereas for large baselines and larger point sets the 
Junction Tree model seems to be the best. It is however 
difficult to draw conclusions from both models in these 
cases, since they are designed for the near-isometric case, 
which is violated for larger baselines. 

5 Conclusions 

We have shown that the near-isometric point pattern 
matching problem can be solved much more efficiently 
than what is currently reported as the state-of-the-art, 
while maintaining the same optimality guarantees for the 
noiseless case and comparable accuracy for the noisy case. 

^In fact, the speed-up appears to be more than an order of mag- 
nitude for the large graphs, which is likely a side effect of the large 
memory requirements of the Junction Tree algorithm. 

^http : //vase . ri . emu . edu/ idb/html/motion/house/index . html. 
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Figure 4: Matching accuracy of our model against that 
of [1]. The performance of our model is statistically in- 
distinguishable from that of [1] for all noise levels. The 
error bars indicate the average and standard error of 50 
experiments. 



Figure 5: Running-time of our model as the jitter varies, 
for different MSE-cutoffs. Speed-ups are almost exactly 
one order of magnitude. 
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Figure 6: Matching accuracy of our model as the MSE- 
cutofF varies. This figure suggests that the higher cutoff 
value should be sufficient when matching larger point sets. 



Figure 7: Top: points matched using the Junction Tree 
algorithm (the points in the left frame were matched to 
the corresponding points in the right frame); 16 points 
are correctly matched by this algorithm. Bottom: points 
matched using our algorithm; 17 points are correctly 
matched. 



This was achieved by identifying a new type of graph with 
the same global rigidity property of previous graphs but 
in which exact inference is far more efficient. Although 
exact inference is not directly possible by means of the 
Junction Tree algorithm since the graph is not chordal, 
what we managed to show is that loopy belief propaga- 
tion in such graph does converge to the optimal solution 
in a sufficiently small number of iterations. In the end, 
the advantage of the smaller clique size of our model dom- 
inates the disadvantage caused by the need for more than 
a single iteration. 
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Figure 8: Matching accuracy of our model using the 
"house" dataset, as the baseline (separation between 
frames) varies. 
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